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Abstract. A priority queue is presented that supports the operations 
insert and find-min in worst-case constant time, and delete and delete- 
min on element x in worst-case 0(\g(min{w x , q x } + 2)) time, where w x 
(respectively q x ) is the number of elements inserted after x (respectively 
before x) and are still present at the time of the deletion of x. Our priority 
queue then has both the working-set and the queueish properties, and 
more strongly it satisfies these properties in the worst-case sense. We also 
define a new distribution-sensitive property — the time-finger property, 
which encapsulates and generalizes both the working-set and queueish 
properties, and present a priority queue that satisfies this property. 
In addition, we prove a strong implication that the working-set property 
is equivalent to the unified bound (which is the minimum per operation 
among the static finger, static optimality, and the working-set bounds). 
This latter result is of tremendous interest by itself as it had gone un- 
noticed since the introduction of such bounds by Sleater and Tarjan [8]. 
Accordingly, our priority queue satisfies other distribution-sensitive prop- 
erties as the static finger, static optimality, and the unified bound. 
Keywords: Data structures, splay trees, priority queues. 



1 Introduction 

Distribution-sensitive data structures are those for which the time bounds to 
perform operations vary depending on the sequence of operations performed [6] . 
These data structures typically perform as well as their distribution-insensitive 
counterparts on a random sequence of operations in an amortized sense; yet 
where the sequence of operations follows a particular distribution, or there is tem- 
poral or spatial locality in the sequence of operations, the distribution-sensitive 
data structures perform significantly better. 

The quintessential distribution-sensitive data structure is splay trees [8]. 
Splay trees seem to perform very efficiently (much faster than 0(\gn) search 
time on a set of n elements) over the sequence of operations that arise natu- 
rally. There still exists no single comprehensive distribution-sensitive analysis of 
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Fig. 1. The implication relationships between various distribution-sensitive properties. 
The implication of the unified bound from the working-set property is a contribution 
of Section 2 of this paper. 



splay trees; instead, there are many theorems and conjectures that character- 
ize the distribution-sensitive properties of splay trees. These properties include 
the static-optimality, static-finger bound, working-set bound, sequential-access 
bound, unified bound, dynamic-finger bound, and the unified- conjecture [1,8]. 
As defined in [8] , the "unified-bound" is the per-operation minimum of the static- 
optimality, static-finger, and working-set bounds. By the "unified-conjecture" , 
we assume the definition given in [1], which subsumes both the dynamic-finger 
and working-set bounds. We refer the reader to [1] and also [8] for a thorough 
definition and discussion of these properties. 

Some of these properties imply others. Figure 1 illustrates such relationship 
between these properties. As a new contribution, we prove in Section 2 the 
implication of the unified bound from the working-set bound. We also show 
that the working-set bound on two sequences is asymptotically the same as the 
working set bound of any interleaving of these sequences; the full proof appears 
in Appendix A. 

Distribution-sensitive data structures are not limited to search trees. Priority 
queues have also been designed and analyzed in the context of distribution- 
sensitivity [2,3,5-7]. It is easy to observe that a priority queue with constant 
insertion time cannot have the sequential-access property (and hence cannot as 
well have the dynamic-finger property), for otherwise a sequence of insertions 
followed by a sequence of minimum-deletions give the elements in sorted order 
in linear time. Therefore, the working-set property has been of main interest for 
priority queues. Informally, the working-set property states that elements that 
have been recently updated are faster to update again compared to the elements 
that have not been accessed in the recent past. Iacono [5] proved that pairing- 
heaps [4] satisfy the working-set property as follows; in a heap of maximum size 
n, it takes O (lg min {n Xl n}) amortized time to delete the minimum element 



x, where n x is the number of operations performed since x's insertion. Funnel- 
heaps are I/O-efficient heaps for which it takes O (lgmin{i x + 2,n}) to delete 
the minimum element x, where i x is the number of insertions made since x's 
insertion (we note that i x < n x ). Elmasry [3] gave a priority queue supporting 
delete- min in O (lg(w x + 2)) worst-case time, where w x is the number of elements 
inserted after x and are still present in the priority queue (w x < i x < n x ) . 

None of these results supports delete with the working-set bound. In sec- 
tion 4, we present a priority queue that supports both delete and delete-min in 
O (\g(w x + 2)) worst-case time (and insertion in worst-case constant time). 

One natural sequence of operations in a data structure is a first-in first-out 
type of updates. Data structures sensitive to these sequences must operate fast 
on elements that have been least recently accessed. This distribution-sensitive 
property is referred to as the "queueish" property in [7] . In the context of priority 
queues, such property states that the time to perform delete or delete-min on an 
element x is O (lg (q x + 2)), , where q x is the number of elements inserted before 
x and are still present in the priority queue. Note that q x = n — w x , where n is 
the number of elements currently present in the priority queue. However, it is 
shown in [7] that no binary search tree can be sensitive to this property. Albeit, 
a priority queue with the queueish property is presented in the same paper. 

It remained open whether there exists a priority queue sensitive to both the 
working-set and the queueish properties. Wc resolve the question affirmatively 
by presenting such a priority queue in Section 5. This priority queue is the most 
comprehensive distribution-sensitive priority queue to date. 

In Section 6, we present a more powerful priority queue that incorporates 
multiple time fingers. We define time fingers t\, t%, . . . , t c as points of time during 
the sequence of updates, which are set on-line as they arrive. Wc define the 
working-set of an element x with respect to time finger tj, w x (ti), as the number 
of elements that have been inserted in the window of time between the insertion 
of x and U and are still present in the priority queue. We say a priority queue 
satisfies the multiple time finger property if the time to delete or delete-min x is 



O I \g{vtnn{w x (ti)} + 2) ] . It is not hard to see that the working-set property is 



equivalent to having a single time finger of t\ = +oo, and the queueish property 
is equivalent to having a single time finger of ti — 0. The priority queue presented 
in Section 5, which supports both the working-set and queueish properties, is 
equivalent to having two time fingers t\ = 0, ti = +oo. In Section 6, we present a 
priority queue that satisfies the property for a constant number of time fingers. 



2 From the working-set bound to the unified bound 




The static finger property states that, for any fixed element / (the finger), the 
amortized time to access an element a is O (\g(d(a, /) + 2)), where d(a,f) is 
the rank difference between a and /. More specifically, for a sufficiently long 



sequence of accesses xi,x 2 , ■ ■ ■ ,x m , the total access time is 



O Jlg(d(i j ,/)+2) 




The static optimality property (entropy bound) states that, for a sequence of 
accesses Xi,x 2 ■ ■ ■ ,x m , where the element corresponding to Xi is accessed q{xi) 
times in the entire sequence, the total access time is 



The working set Wx(i) of an operation Xi in sequence X, which accesses 
element a;,, is defined as the number of distinct items accessed since the last 
access to Xi. The working-set property states that, for a sufficiently long sequence 
of accesses x\, x 2 . . . , x m , the total access time is 



We observe that the working-set bound of two sequences does not asymptot- 
ically change when those two sequences are arbitrarily interleaved. We state this 
theorem formally below and prove it in Appendix A. This result will be needed 
to prove the main claim of this section, and is interesting in its own right. 

Theorem 1. Let X be a search sequence and let Y and Z be two subsequences 
of X that partition X. Stated another way, X is an interleaving of Y and Z. 
Then, 



Iacono [6] observed that the working-set property implies the static-optimality 
and static-finger properties. Therefore, the working-set property is the strongest 
of the three properties. The unified bound indicates an apparently stronger prop- 
erty than the three properties. The unified bound states that the total time for 
a sufficiently long sequence of accesses X = x\, x 2 ■ ■ ■ , x m and any fixed finger / 



However, we show next that the working-set property is asymptotically as 
strong as the unified property. 

Theorem 2. The working-set bound is asymptotically equivalent to the unified 
bound. 
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Proof. Clearly, the unified bound implies the working-set bound as 



^lgmin<U^,/) + 2, + 1, «,(») + 2 I < E lg(w(i) + 2). 

Let y be the subsequence of X — x±, X2, ■ ■ ■ x m consisting of those elements 
x t where w(i) + 2 = min [d(x t , f) + 2, + 1, w(i) + 2}, and let F be the 

subsequence of X crated by removing Y. Let m(i) = j if yi is a:,, and let 
= j if |/j = Xi. We also subscript the w(-), d(-,-) and </(•) to explicitly 
indicate which sequence these measures are with respect to. Then: 
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Equation 3 splits one sum into two using the partitioning of X into Y and Y. 
The left term of Equation 4 is obtained by replacing wx with toy , which only can 
cause a decrease. The right sum of Equation 4 is a static-optimality type formula, 
which is asymptotically the same as the right sum of Equation 3. This has the 
property that if y,- t = yj, the zth and jth term of this sum are identical, and we 
call this value the weight of an element. The 0(1) numerator is chosen so that 
the weights sum to 1. Since this is just a static-optimality type weighting scheme, 
this sum is at least the entropy of the frequencies, and this observation leads 
to Equation 5. To get from Equation 5 to Equation 6 the fact that the static- 
optimality bound is big-Omega of the working-set bound; this is Theorem 10 
of [6]. Moving from Equation 6 to Equation 7 requires the observation that the 
sum of the working-set bounds of two sequences of operations are asymptotically 
the same as the working-set bound of the interleaving of two sequences; this is 
stated above and proved in Appendix A as Theorem 6. 



Fig. 2. The recursive structure of a (2, 3) binomial tree of rank r: Subtrees rooted at 
the children of the root are (2, 3) binomial trees of ranks 0, 1, . . . , r — 1. The sequence 
of ranks forms a non-decreasing sequence from right to left such that each value from 
0, 1, . . . , r — 1 occurs either once or twice. 

3 A priority queue with the working-set property 

Our priority queue builds on the priority queue in [3], which supports insertion 
in constant time while the minimum deletion fulfills the working-set bound. The 
advantage of the priority queue in [3] over those in [2, 4, 5] is that it satisfies the 
stronger working-set property in which elements that are deleted do not count 
towards the working sets. Next, we outline the structure of this priority queue. 

The priority queue in [3] comprises heap-ordered (2, 3) binomial trees. As 
defined in [3], the subtrees of the root of a (2,3) binomial tree of rank r are 
(2, 3) binomial trees; there are one or two children having ranks 0, 1, . . . , r — 1, 
ordered non-decreasing from right to left. It is trivial to verify that the rank r 
of an n-node (2,3) binomial tree is 0(\gn). Figure 2 illustrates the recursive 
structure of a (2,3) binomial tree. 

The ranks of the (2, 3) binomial trees of the priority queue are as well non- 
decreasing from right to left. For the amortized solution, there are at most two 
trees per rank. The main obstacle against achieving the bounds in the worst 
case is the possibility that a long sequence of consecutive ranks would have no 
corresponding trees. To overcome this problem, in the worst-case solution, the 
number of trees per rank obey an extended-regular number system that imposes 
stronger regularity constraints, which implies that the ranks of any two adjacent 
trees differ by at most 2 (see [3] for the details). 

The root of every (2, 3) binomial tree has a pointer to the root with the 
minimum value among those to the left of it. Such prefix-minimum pointers 
allow for finding the overall minimum element in constant time, with the ability 
to maintain such pointers after deleting the minimum in time proportional to 
the rank of the deleted node. Figure 3 illustrates how various (2, 3) binomial 
trees constitute the priority queue. 



Fig. 3. (2, 3) binomial trees comprise the priority queue: The rank of the trees are non- 
decreasing from right to left. This figure illustrates the amortized solution where there 
are at most two trees per rank. In the worst-case solution, the number of trees of each 
rank follows a much stricter number system. Prefix-minimum pointers are maintained 
at the root of the trees. Each tree root points to the minimum root to the left of it. 



A total order is maintained indicating the time the elements were inserted. 
We impose that across binomial trees, if binomial tree T\ is to the right of another 
T2, then all elements in T\ have been inserted after those in T2. Furthermore 
within an individual binomial tree, the preorder ordering of elements with a 
right-to-left precedence to subtrees must be chronologically consistent with the 
insertion time of these elements. When performing operations, we occasionally 
disobey this ordering by reversing the order of two entire subtrees. We mark 
these points by maintaining a reverse bit with every node x; such reverse bit 
indicates whether the elements in x's subtree were inserted before or after the 
elements in x's parent and those in the descendants of the right siblings of x. 

Two primitive operations are split and join. A tree of rank r is split to two or 
three trees of rank r — 1; this is done by detaching the one or two children of the 
root having rank r — 1. On the other hand, two or three trees of rank r — 1 can 
be joined to form a tree of rank r; this is done by making the root(s) with the 
larger value the leftmost child(ren) of the other, and setting the reverse bit(s) 
correctly. To join a tree of rank r — 1 and a tree of rank r — 2, we split the first 
tree then join all the resulting trees; the outcome is a tree which has rank either 
r — 1 or r. With these operations in hand, it is possible to detach the root of a 
(2, 3) binomial tree of rank r and reconstruct the tree again as a (2, 3) binomial 
tree with rank r — 1 or r; this is done by repeated joins and splits starting from 
the rightmost subtrees of the deleted root to the leftmost (see [3] for the details). 

To insert an element, a new single node is added as the rightmost tree in the 
priority queue. This may give rise to several links once there are three trees with 
the same rank; the number of such links is amortized as a constant, resulting in 
the constant amortized cost per insertion. After every link, the prefix-minimum 
pointer of the surviving root may need to be updated. For the worst-case solution, 
the underlying number system guarantees at most one join per insert. 

To perform delete-min, the tree T of the minimum root is identified via the 
prefix-minimum pointer of the rightmost root, the tree T is reconstructed as a 
(2, 3) binomial tree after detaching its root. This may be followed by a split and 



a join if T has rank one less that its original rank. Finally, the prefix-minimum 
pointers are updated. For the amortized solution, several splits of T may follow 
the delete-min operation. Starting with T, we repeatedly split the rightmost tree 
resulting from previous splits until such tree and its right neighbor (the right 
neighbor of T before the delete-min) have consecutive ranks; this splitting is 
unnecessary in the worst-case solution. It is not hard to conclude that the cost 
of delete-min is O(r), where r is rank of the deleted node. In the worst-case 
solution, the rank of the deleted node x is 0(\g(w x + 2)). For the amortized 
solution, an extra lemma would prove the same bound in the amortized sense. 

4 Supporting delete within the working-set bound 

The existing distribution-sensitive priority queues [2, 3, 5-7] do not support delete 
within the working-set bound. In this section, we modify the priority queue out- 
lined in section 3 to support deletion within the working-set bound. 

Including delete in the repertoire of operations is not hard but should be done 
carefully. The major challenge is to correctly maintain the total order imposed 
by the reverse bits following deletions. 

We start by traversing upwards via the parent pointers from the node x to- 
be-deleted until the root of the tree of x is reached. Then starting at this root, 
the current subtree is repeatedly split into two or three trees, one or two of them 
are pushed to a stack while continuing to split the tree that contains x, until we 
end up with a tree whose root is x. At this stage, we delete x analogously to 
the delete-min operation; the node x is detached and the subtrees resulting from 
removing x are incrementally joined from right to left, while possibly performing 
one split before each join (similar to the delete-min). 

We now have to work our way up to the root of the tree and merge all 
subtrees which we have introduced by splits on the way down from the root. 
The one or two trees that have the same rank are repeatedly popped from the 
stack and joined with the current tree, while possibly performing one split before 
each join (as required for performing a join operation). Once the stack is empty, 
a split and a join may be performed if the resulting tree has rank one less that 
its original rank (again analogously to the delete-min operation). 

The total order is correctly maintained by noting that the only operations 
employed are the split and join, which are guaranteed to set the reverse bits 
correctly [3]. Since the height of a (2,3) binomial tree is one plus its rank, the 
time bound for delete is 0(r), where r is the rank of the tree that contains the 
deleted node. This estqablishes the same time bound as that for delete-min in 
both the amortized and worst-case solutions (see [3] for the details). 

Theorem 3. The priority queue presented in this section performs find-rain 
and insert in constant time, and both delete and delete-min of an element x in 
O (lg (w x + 2)) time , where w x is the number of elements inserted after x and 
are still present at the time of x 's deletion. 
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Fig. 4. The priority queue satisfying the queueish property: the priority queue com- 
prises of two queues one of which has tree ranks increasing from right to left (right 
queue) and one increasing from left to right (left queue). We also maintain that the 
ranks of the largest trees in the queues must differ by at most one. The prefix-min 
pointers are stored independently in both sides. 

5 Incorporating the queueish property 

The queueish property for priority queues states that the time to perform delete 
or delete- min on an element x is O (lg(n — w x + 2)), where n is the number 
of elements currently present in the priority queue, and w x is the number of 
elements inserted following the insertion of x and are still present. In other words, 
the queueish property states that the time to perform delete or delete-min on an 
element x is O (lg (q x + 2)), where q x — n — w x is the number of elements inserted 
prior to insertion of x and are still present in the priority queue. Queaps [7] are 
queueish priority queues that support insert in amortized constant time and 
support delete-min of an element x in amortized O (lg(q x + 2)) time. 

We extend our priority queue with the working-set bound to also support 
both delete and delete-min within the queueish bound. Accordingly, the priority 
queue simultaneously satisfies both the working-set and the queueish properties. 
Instead of having the ranks of the trees of the queue non-decreasing from right 
to left, we split the queue in two sides, a right queue and a left queue, forming 
a double-ended priority queue. The ranks of the trees of the right queue are 
monotonically non-decreasing from right to left (as in the previous section), and 
those of the left queue are monotonically non-decreasing from left to right. We 
also impose the constraint that the difference in rank between the largest tree 
on each side is at most one. Figure 4 depicts the new priority queue. 

The prefix-minimum pointers in the left and right queues are kept indepen- 
dently. In the right queue, the root of each tree maintains a pointer to the root 
with the minimum value among those in the right queue to the left of it. Con- 
versely, in the left queue, the root of each tree maintains a pointer to the root 



with the minimum value among those in the left queue to the right of it. To find 
the overall minimum value, both the left and right queues are probed. 

Insertions are performed exactly as before in the right queue. The delete- 
min operation is performed in the left or right queue depending on where the 
minimum lies. Deletions are also performed as mentioned in the previous section. 

However, we must maintain the invariant that the difference in rank between 
the largest tree in the left and right sides is at most one. Since the total order 
is maintained among our trees, this invariant guarantees that the rank of the 
tree of an clement x is O (lg(min {w^, q x } + 2)). As a result of an insertion or a 
deletion, the difference in such ranks may become two. Once the largest rank on 
one side is two more than that on the other side, the trees with such largest rank 
are split each in two or three trees, and the appropriate tree among the resulting 
ones is moved to the other side, increasing the largest rank on the second side 
by one. As a result of those splits, the number of trees of the same rank on 
the first side may now exceed the limit, and hence a constant number of joins 
would be needed to satisfy the constraints. Once a tree is moved from one side 
to the other, the prefix-minimum pointers of the priority queue on the second 
side need to be updated. Because such action happens only after a lot (linear) 
number of operations, updating the prefix-minimum pointers only accounts for a 
constant extra in the amortized cost per operation. If we want to guarantee the 
costs in the worst case, updating those prefix-minimum pointers is to be done 
incrementally with the upcoming operations. 

A deletion of a node a; in a tree of rank r would still cost 0(f) time, but now 
r = 0(lg (min {w x , q x } + 2)) in the amortized sense (for the amortized solution) 
or in the worst-case sense (for the worst-case solution). 

Theorem 4. The priority queue presented in this section performs find-min 
and insert in constant time, and both delete and delete-min of an element x in 
O (lg(min {w x , q x } + 2)) time, where w x and q x are the number of elements in- 
serted after, respectively before, x and are still present at the time ofx's deletion. 

6 Supporting multiple time fingers 

In this section, we introduce a new distribution-sensitive property, which encap- 
sulates both the working-set and the queueish properties. We refer to this prop- 
erty as the multiple time-fingers property. Time fingers ti, t 2 , ■ ■ ■ , t c are points 
of time during the sequence of updates which are set and fixed as they arrive. In 
other words, as the time progresses with the sequence of operations at multiple 
occasions the user can specify the time being as a time finger. The elements 
inserted in the temporal vicinity of these time-fingers must be accessible fast. 

We define, w x (ti), the working-set of an element x with respect to time finger 
ti, as the number of elements that have been inserted in the window of time 
between the insertion time of x and time tj and are still present in the priority 
queue at the time of x's deletion. We say a priority queue satisfies the multiple 
time-finger property if the time to perform delete or delete-min operations on 



an element x is (3(lg(min {w x (ti)} + 2)). Clearly, the priority queue we have 

i=l 

presented so far corresponds to a priority queue with the time-finger property 
for two time fingers of t\ = (the queueish property) and t 2 = +00 (the working- 
set property). In this section, we present a priority queue that satisfies the time- 
finger property for any constant number of time fingers. 

The structure consists of multiple double-ended priority queues of Section 5. 
We start with a single copy of a double-ended priority queue PQo at the be- 
ginning and at each point when a new time finger is introduced we finalize the 
priority queue and start a new one. Therefore, corresponding to c time-fingers 
t\ = 0, . . . ,t c = 00, we have c— 1 double-ended priority queues PQi, ■ ■ ■ , PQ c -i- 

Insertions are performed in the last (at the time when the insertion is per- 
formed) priority queue, and by Theorem 4 take constant time. For delete op- 
erations, we are given a reference to an element x to delete, we determine 
to which priority queue PQj the element belongs and delete it. This requires 
O (lg(min \w x (tj), w x (tj + i)} + 2)) time, as indicated by Theorem 4. Since x be- 
longs to PQj, for any i < j, w x (tj) < w x (ti), and for any i > j + 1, w x (tj + i) < 

c 

w x (ti). It follows that lg(min {w x (tj), w x (tj+i)} + 2) = lg(min {w x (ti)} + 2). 

i— 1 

For delete-min operation, it suffices to note that the find-min operation takes 
constant time in the double-ended priority queue of Theorem 4. Therefore, we 
can determine in constant time which priority queue contains the minimum and 
perform the delete-min operation in there. The running time argument is the 
same as that for the delete operation. 

Theorem 5. Given a constant number of time fingers t\ = 0, t2, ■ ■ ■ , t c = 00, the 
priority queue presented in this section performs find-min and insert in constant 

c 

time, and both delete and delete-min of an element x in 0(lg(min {w x (U)} + 2)) 

i— 1 

time, where w x (ti) is the number of elements that have been inserted in the 
window of time between the insertion time of x and time ti and are still present 
at the time of x 's deletion. 

7 Conclusion and future work 

We presented a hierarchy of distribution-sensitive properties in Figure 1. We es- 
tablished that the working-set property is equivalent to the unificd-bound prop- 
erty. The queueish property introduced by [7] is missing from the picture as it is 
neither derived by or implies any other property. Nevertheless, we argued that 
it is a very natural distribution-sensitive property. 

We considered the case of distribution-sensitive priority queues. Precisely 
speaking, we designed a priority queue that supports insertions in constant time 
and dclctc-min and delete operations in distribution-sensitive bounds. Provably, 
priority queues cannot satisfy the sequential-access property and in accordance 
neither the dynamic-finger nor the unified properties. We therefore focused on 
other distribution-sensitive properties, namely: the working-set and the queueish 
properties. We presented a priority queue that satisfies both properties. Our 



priority queue build on the priority queue of [3], which supports insertion in 
constant time and delete-min in the working-set time bound. We showed that the 
same structure can also support delete operations within the working-set bound. 
We then modified the structure to satisfy the queueish property as well. It is 
worthy to note that the priority queue designed supports the stronger definition 
of the working-set and the queueish properties in which the elements deleted do 
not influence the time bounds. 

Our result about the equivalence of the working-set property and the unificd- 
bound property then implies that our priority queue also satisfies the unified- 
bound, static-optimality and static-finger properties. 

We defined the notion of time fingers, which encapsulate the working-set and 
the queueish properties. The priority queue described thus far corresponds to a 
priority queue that supports two time fingers. We extended the support to any 
constant number of time fingers. 

The bounds mentioned are amortized. However, we showed that the time 
bounds for the working-set and queueish properties can also be made to work in 
the worst case. More generally, the multiple time-finger bounds can be made to 
work in the worst case. However, the time bounds for other properties: unified 
bound, static optimality, and static finger naturally remain amortized. 

As for future work, one key operation is still missing from the supported 
repertoire of operations; that is the decrease-key operation. We leave open the 
question of whether decrease-key operations can also be performed in constant 
time while supporting the distribution-sensitive bounds for delete operations. 
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A The effect of merging sequences on the working-set 
bound 



Theorem 6. Let X be a search sequence and let Y and Z be two subsequences 
of X that partition X. Stated another way, X is an interleaving of Y and Z. 
Then, 

\X\ (\Y\ \Z\ \ 

^lg( W x« + 2) = 6> + 2) + E "6(^(0 + 2 ) ■ 

i=l \i=l i=l / 

Proof. The J? direction is immediate and thus we focus on the O direction. 

Let mx{i) = j if Ui corresponds to x m ^ in the subsequence relationship of Y 
with regard to X. Let my and all subsequent notion in this section subscripted 
on Y have an analgous definition on Z. 

Let uJx{i) be the largest j < i such that Xi — Xj. That is, x ux a\ is the 
previous access to the operation Xi in X. Let Wx(i) be the set of indicides j 
such that Xj is the first occurence of that element in the range x ux ^ + ..Xi-i. 
This definition is such that Wx{i) is the set envisioned by the concept of the 
"working set" of x% and is constructed so that |Wx(i)| = w x (i). 

Observe that the largest j in Wx{i) has the property that its working set 
number is at least Wx(xi) — 1 since it is the first occurrence of the value Xj where 
j € Wx(i), and there are |Wx(i)| — 1 = w x {xi) — 1 different before it. In general, 
the fcth largest j in W x (i) has a working set number at least w x (xi) — j. Let 
W' x be the |Wx(i)|/2 largest elements of W x . This all ||tyx(i)| elements j of 
W' x have working set number at least \wx{i)- 

Let Ai be formally defined to be: 

M = {i\wx{m Y (j)) > w Y (j) 2 and i = [lg w x {m Y {j))\ } 

That is, it consists of the indicies of those elements in Y whose logarithm of 
its working set number is double in X relative to Y and where it is in the range 
[2 2 ' .. ■ 2 T ). All sets Ai are thus disjoint, and all indicies i in Vj] are in some 
set Ai unless the log of the working set of the element y» does not change by 
more than a factor of two as a result of the merge with Z. 

Now, pick some element j £ Ai. Recall that Wx(i) is the working-set of Xi 
in X, and W' x (i) is half of the elements of Wx(i)- Some of these elements come 
from Y, and some from Z. However, the vast majority come from Z, since the 
total number is at least the number from Y squared. Very conservatively, at least 
half of the elements in W'(i) have some j such that mz(j) = i. We say that 
these elements of Z are covered by the element j at level i; this set is represented 
by Ci(j) and is defined as follows, and then bounded: 



Ci(j) = {k\m z (k) e W' x (j)} 



2 2 ' +1 > wx{j) = \W x (j)\ > \Ci(j)\ > \\W' x (j)\ = \\W x (j)\ = \w x (j) > V 

By construction, for any fixed k, there are only at most 2 2>+1 elements j such 
that Ci(j) — k. Thus the size of the the covered set of the union of all Ci(j) 
where j <G is we denote this set as Cj. So, therefore: 

oo \Z\ 

^|C 4 |2 l <^21og Wz (^) 

4 = 1=1 

This is because for each Zj, contributes at most to each \d\ where wz(zi) < 
2 2%+1 and is not part of any d when wz(zi) > 2 2 ' +1 . 
Putting this information together gives: 



\y\ \y\ / oo \ 

J2w x (x m{{) )-J2w Y (x m{i) ) = e \Y\ +J2 W 

i=l i=l V i=0 / 

= 0^|F|+£|a|2^ 

Taking this equation, linearly combining it with the symmetric version where 
Y and Z are transposed, and noting that \Y\ and \Z\ are lower-order terms in 
the resultant equation yields the claim of this Theorem. 



